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Abstract 

The understanding of the complex genotype-phenotype architecture of human pigmentation has clear implications for the 
evolutionary history of humans, as well as for medical and forensic practices. Although dozens of genes have previously 
been associated with human skin color, knowledge about this trait remains incomplete. In particular, studies focusing on 
populations outside the European-North American axis are rare, and, until now, admixed populations have seldom been 
considered. The present study was designed to help fill this gap. Our objective was to evaluate possible associations of 18 
single nucleotide polymorphisms (SNPs), located within nine genes, and one pseudogene with the Melanin Index (Ml) in 
two admixed Brazilian populations {Gaucho, N = 352; Baiano, N = 148) with different histories of geographic and ethnic 
colonization. Of the total sample, four markers were found to be significantly associated with skin color, but only two 
{SLC24A5 rsl426654, and SLC45A2 rsl6891982) were consistently associated with Ml in both samples {Gaucho and Baiano). 
Therefore, only these 2 SNPs should be preliminarily considered to have forensic significance because they consistently 
showed the association independently of the admixture level of the populations studied. We do not discard that the other 
two markers {HERC2 rsn29038 and TYR rsl 126809) might be also relevant to admixed samples, but additional studies are 
necessary to confirm the real importance of these markers for skin pigmentation. Finally, our study shows associations of 
some SNPs with Ml in a modern Brazilian admixed sample, with possible applications in forensic genetics. Some classical 
genetic markers in Euro-North American populations are not associated with Ml in our sample. Our results point out the 
relevance of considering population differences in selecting an appropriate set of SNPs as phenotype predictors in forensic 
practice. 
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Introduction 

In humans and other animals, pigmentation results from 
melanin synthesis, which occurs in melanocytes stored in vesicles 
known as melanosomes. Melanocytes are found in the epidermis, 
hair bulb, and iris, and are directly responsible for the 
pigmentation of these organs and structures [1,2]. The color 
spectrum observed in human eyes, hair, and skin is related to the 



type and quantity of melanin produced, to the size, number, and 
pattern of packaging of melanosomes, and to the pH of the 
melanosomal environment [3,4]. There are two main melanin 
types; eumelanin and pheomelanin, which are responsible for the 
black/brown and yellow/red spectra, respectively. 

The human pigmentation pathway is regulated by a network 
controlled by many genes, as well as by environmental, 
mechanical, and epigenetic factors. For instance, Quillen et al. 
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[5] identified 76 skin pigmentation candidate genes from the 
Online Mendelian Inlieritance in Man (OMIM) database using 
the following key-words: "color" and "pigmentation". This gene 
set included both classical and non-classical genes. Candidate 
pigmentation genes identified to date act over different stages of 
the above-mentioned processes [6]. Additional information about 
the function, localization, number of exons, and other relevant 
data related to these and others genes can be found in Cerqueira et 
al. [7,8]. 

A considerable number of genetic studies involving normal and 
pathological variation in human pigmentation have emerged 
recently [5,8-19]. Some of these investigations showed that it is 
possible to predict human color phenotypes from genotype data 
with different levels of accuracy, highlighting the importance of 
these types of studies for forensic practices [8,16]. Furthermore, it 
has been suggested that some variants associated with pigmenta- 
tion also confer susceptibility or resistance to skin cancer 
[12,20,21], since the presence of various skin tones is attributed 
to adaptation to diverse environments after dispersal of the 
anatomically modern Homo sapiens from Africa to other continents 
[9,13-15,22-25]. 

Additionally, these studies documented the extensive diversity of 
pigmentation genotype-phenotype architecture across human 
populations. For example, Quillen et al. [5] showed that in 
addition to the classical genes SLC24A5 and SLC45A2, others, such 
as OPRAH and EGFR, have also contributed to differences in 
pigmentation between Native Americans and Europeans. More- 
over, Norton et al. [6] suggested that polymorphisms in SLC24A5, 
SLC45A2, and TTR have a predominant role in the evolution of 
lighter skin color in Europeans, but not in East Asians. This 
indicates recent convergent evolution of lighter pigmentation 
phenotypes, and is suggestive the importance of natural selection 
in this process. Quillen and Shriver [15] evaluated skin color 
architecture at the population level and summarized the situation 
as follows: (a) despite gene flow among human populations, 
differences in allele frequencies for pigmentation genes are 
nonetheless observed; and (b) signatures of natural selection in 
some of these genes are observed among human populations, 
whereas in other genes, the signal is population-exclusive. 

As noted above, several studies have indicated different 
populations have different genetic backgrounds for pigmentation 
genes. Therefore, caution should be exercised when extrapolating 
results from one population to another. The gene pools of 
contemporary admixed Brazilian populations present varying 
levels of contribution from continental ancestral groups (Native 
American, European, and African) reflected in the striking 
differences that have been reported among them [26,27]. 
Nonetheless, few studies have examined polymorphisms in 
pigmentation candidate genes in these Brazilian populations to 
date [28] . Accordingly, the main objective of the present work was 
to evaluate the possible association between 18 single nucleotide 
polymorphisms (SNPs), located in nine candidate pigmentation 
genes and one pseudogene, and the Melanin Index (MI) in two 
Brazilian populations with different admixture levels. 

Materials and Methods 

Recruitment of study samples 

This study recruited a subset of volunteers from the 1,600 
Brazilian participants in the Consortium for the Analysis of the 
Diversity and Evolution of Latin America (CANDELA, http:// 
www.ucl.ac.uk/silva/ candela/ As a whole, the CANDELA project 
recruited 7,500 subjects over 18 years of age from five Latin 
America countries (Brazil, Colombia, Chile, Mexico and Peru). 



For this investigation, we worked with all available samples to date 
from persons born in Rio Grande do Sul (RS) (n = 352), the 
southernmost state of Brazil, and from those born in the state of 
Bahia (BA) (ii=148). Our goal was to evaluate possible 
associations between 18 SNPs and MI in two populations with 
contrasting ancestry formation. 

The above-indicated states have distinct colonial histories: RS is 
characterized by more substantial European colonization (a 
significant number of non-Portuguese European immigrants 
arrived in RS in the 19* century), whereas African heritage is 
more visible in BA with respect to both phenotypic and cultural 
aspects. Apart from these widely recognized characteristics, other 
historical, demographic, and genetic particularities can distinguish 
these populations [26,27,29-33]. 

The traditional Gaucho culture has roots in the South American 
Pampa region, whose geographical areas cover parts of RS, 
Argentina, and Uruguay. However, in modern Brazil, the word 
Gaucho {Gaucho in Portuguese) is used to refer to anyone born in RS, 
and not only to those born in the Pampean region of this state. In 
the present study, we employ this term to refer to any volunteer 
from RS, whereas the term Baiano is used to identify individuals 
from BA. 

This project was approved by the Research Ethics Committees 
of the Uiiiversidade Federal do Rio Grande do Sul, Hospital de 
Cliiiicas de Porto Alegre, and the Universidade Estadual do 
Sudoeste da Bahia (Resolutions 18208/2010, 100565/2011, and 
212/2010, respectively). AH subjects signed an informed consent 
form approved by the above-mentioned ethics committees, in 
accordance with the Declaration of Helsinki. 

Ml measurements 

The CANDELA protocol involved the acquisition of data on 
physical appearance, including a quantitative measure of skin 
color using the Derma spectrometer (DSM II ColorMeter, 
CyberDerm Inc., USA), which provides the MI. In humans, 
values ranging from 20 to 100 reflect fairer to darker skin tones. 
MI was measured on the proximal medial portion of both arms to 
generate a single average value. This location was chosen because 
it captures the constitutive skin pigmentation (the basal quantity of 
melanin), with potentially little effect of sun exposure [5,34]. 

Genetic analysis 

DNA was extracted from total blood samples using the salting 
out method [35] or with the QIAamp 96 DNA blood Kit (Qiagen), 
and genotyping procedures were performed with KASP genotyp- 
ing assay (LGC Genomics: www.lgcgenomics.com). Eighteen 
SNPs were investigated: rsl524668 {ADAM17), rs4785763 
(AFG3L1, a pseudogene), rs6058017 [ASIP), rs 1129038 [HERC2), 
rsl805009, rsl805008, and rsl805007 [MCIRI rsl800407, 
rs 1800401, and rs 18004 14 {0CA2), rs 1426654 {SLC24A5, also 
identified as NCKXS), rs6867641, rs26722, and rsl6891982 
(SLC45A2, also identified as IvIATP), rs3750965 and rs3829241 
(TPCM21 and rsl042602 and rsl 126809 (located at the TYR gene). 
These polymorphisms were chosen because previous studies have 
linked them either directly or indirectly with the human 
pigmentation pathway (Table SI in File SI). 

Statistical analyses 

Allele frequencies were estimated by gene counting. Agreement 
of genotype frequencies with Hardy- Weinberg equilibrium (HWE) 
was tested using the "Utility programs for analysis of genetic 
linkage" [36]. Allele and genotype frequency differences were 
compared between Gaucho and Baiano samples using Pearson or 
Fisher exact tests. Student's i-test or the non-parametric Wilcoxon- 
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Mann- Whitney tests were used to compare MI, age and ancestry 
(European, African and Native American) between the Gaucho and 
Baiano samples. The assumptions of normality and homoscedas- 
ticity were tested using Kolmogorov-Smirnov and Levene's tests. 

Population structure due to admixture is a well-known 
confounding factor in association studies. To minimize this 
problem, the proportion of African, European and Amerindian 
ancestry was estimated using the 40 Ancestry Informative Markers 
(AIMs) studied in all CANDELA samples. Paschou et al. [37] 
identified 5,000 SNPs, highly informative for worldwide conti- 
nental ancestry estimation (http://www.cs.rpi.edu/~drinep/ 
HGDPAIMS/WORLD_5000_INFAIMs.txt). Allele frequencies 
in Native Americans are available for a fraction of these markers 
[38]. The 40 markers with the highest inter-geographical group 
allele-frequency differences, together with the lowest intra- 
geographical group variation were selected for use in the 
CANDELA analysis. ADMIXTURE software [39] was used to 
estimate the proportions of African, European and Native 
American ancestry, as well as their standard errors. More details 
about these AIMs and the admixture analysis can be found in 
Ruiz-Linares et al. [40]. We also estimated the correlation 
between 1/MI and ancestry proportion using Pearson's prod- 
uct-moment coefficient, to evaluate the influence of ancestry in 
our dataset. 

ANOVA (General Linear Models) were employed to test for the 
association of genetic polymorphisms with the melanin index (MI) 
by comparing these values across genotypes; Sidak-corrected 
multiple comparisons were performed when appropriate. The 
proportions of African and European ancestries were included in 
the model as covariates. Note that covariates should be included in 
the ANOVA only if they are associated with both dependent 
(melanin index) and independent (genotype) variables. We also 
included the group label [Gaucho vs. Baiano) as an additional 
covariate in the total sample analysis to assess whether the results 
were robust to this additional known level of population structure. 
A Bonferroni-corrected value threshold of 0.0028 was considered 
statistically significant for ANOVA tests. 

The statistical analyses were performed using WinPEPI version 
4.0 [41], or PASW (former SPSS) statistical package, version 18.0. 

Results 

Management of quantitative data and gene frequency 
analyses 

Since the assumptions for the ANOVA and t-tests were not 
satisfied for age and MI, we used 1/MI and I /age transforma- 
tions. Non-parametric methods were used when this transforma- 
tion was not enough to satisfy those assumptions. The mean age 
for the total sample (N = 500) was 25 (range 18-85) years, 168 
(33.6%) being males and 332 (66.4%) females. The Gaucho mean 
age (26; range 18-85) years diflFered significantly from the Baiano 
mean age (23; 18-62) (/<<0.00I). The mean MI of the total sample 
was 32.90 (range 22.04—76.40), while Gaucho and Baiano showed 
average MI values of 31.54 (22.04-76.40) and 36.68 (23.72- 
65.01), respectively. These results indicate that the Baiano group 
presents a darker skin color compared with the Gaucho group {p< 
0.001). 

In the Gaucho sample, genotype frequencies were in accordance 
with those expected by HWE, except for SLC24A5 rs 1426654 
{%^ = 11.051, ^<0.01), which showed a lower number of hetero- 
zygotes than expected. For the Baiano group, only the genotype 
frequencies of SLC45A2 rs 1 689 1 982 were not in HWE (x^ = 6.9 1 1 , 
p<Q.Q\), also due to a lower than expected number of 
heterozygotes. Allele and genotype frequencies were statistically 



different between the 2 populations for ASIP rs6058017, HERC2 
rsl 129038, SLC24A5 rs 1426654, and SLC45A2 rs 1689 1982 (Table 
S2 in FUe SI). 

Association between Ml and genotypes 

Aiming to evaluate possible population stratification due to 
admixture, we first tested the diflFerences between Gaucho and 
Baiano samples with respect to European, African, and Native 
American ancestries measured using the 40 Ancestry Informative 
Markers employed in the CANDELA Consortium. The two 
groups differed for African and European contributions (/)<0.001), 
as well as Native American ancestry (/) = 0.017). The percentage 
mean values are: Gaucho: European, 84.86; African, 6.99; and 
Native American, 8.15; Baiano: 61.47, 29.91, and 8.62, respectively 
(Table S3 in File SI). 

We also observed a moderate correlation between 1/MI and 
European (r = 0.516; p<Q.QQ\); and African (r = -0.598; /)<0.001) 
proportions; and a weak correlation between 1/MI and Native- 
American ancestry (r = -0.167; p<Q.QQl). However, only the 
African and European ancestries were associated with both 
genotypes and the melanin index, as opposed to the Native 
American ancestry. Sex was not associated with MI (/; = 0.8 1 7) and 
a negligible correlation (-0.089; = 0.046) was observed between 
MI and age. Therefore, in the ANOVA analysis of the association 
between the 18 genetic markers and MI, we used European and 
African proportions as covariates to adjust for population 
stratification. 

Three sets of ANOVA analyses were performed to evaluate the 
association between MI and genotypes for the 18 markers: for the 
total sample (N = 500), for the Gaucho subsample (N = 352), and for 
the Baiano subsample (N=148). The results are presented in 
Table 1. In the total sample, four polymorphisms showed a 
statistically significant association with MI: SLC24A5 rsl 426654 
(/)<0.001) and SLC45A2 rsl6891982 (/)<0.001); and HERC2 
rsl 129038 (/) = 0.001), and TYR rsl 126809 (/) = 0.001). But when 
the Gaucho and Baiano populations were considered separately, only 
two of these SNPs {SLC24A5 rsl426654 and SLC45A2 rsl6891982) 
remained significantly associated with MI (/)<0.001). The 
inclusion of the group label [Gaucho vs. Baiano) as an additional 
covariate in the combined population analysis did not change our 
results (data not shown). 

The average values obsei"ved for MI suggest that for loci HERC2 
rsl 129038, SLC45A2 rsl6891982 and TYR rsl 126809, the 
respective alleles A, C, and A, present a dominant effect 
(Table 2), while alleles A and G of the SLC24A5 rsl 426654 locus 
seem to have a codominant effect. Alleles HERC2 rsl 129038 G, 
SLC45A2 rsl6891982 C, SLC24A5 rsl426654 G, and TYR 
rsl 126809 G were associated with darker skin pigmentation 
(Table 2). 

Discussion 

Knowledge about the connection between genetic variation and 
complex traits has improved markedly in recent years. Neverthe- 
less, current knowledge of genetic predictors of skin color remains 
limited [42] , particularly with respect to populations outside of the 
European-North American axis. Our intention was to help fill this 
gap by demonstrating the effect of genetic variants for pigmen- 
tation in two admixed Brazilian populations. 

Our results showed that just two polymorphisms {SLC24A5 
rsl 426654 and SLC45A2 rsl 689 1982) seem to behave relatively 
consistently as skin color predictors in admixed samples. HERC2 
rsl 129038 and TYR rsl 126809 were significantly associated with 
MI in the total sample analyses only. We do not discard the 
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possibility that these two markers might also be relevant, but 
additional studies are necessary to confirm their real importance 
for skin pigmentation. It is possible that their association in Gaucho 
and Baiano disappeared due to small sample size when these 
populations were considered separately. On the other hand, some 
classical markers associated with skin pigmentation in Europeans 
and others populations (Table SI in File SI) are not associated in 
our admixed samples (for example, ASIP rs6058017, MCIR 
rsl805009, 0CA2 rsl800414, SLC45A2 rs6867641), indicating 
the possibility that they are population-specific. Therefore, they 
should be avoided for general inferences in forensic DNA 
phenotyping. This notable absence of association between classical 
skin pigmentation loci (including ASIP and MCIR) and skin color 
also was noted by Beleza et al. [43] studying Cape Verdean 
subjects with extensive West African/European admixture. On the 
other hand, similarly to us, they found a significant association of 
SLC24A5 rs 1426654 and SLC45A2 rs 1689 1982 with this trait. 

Pigmentation dilfers due to the amount and type of melanin 
synthesized in melanocytes, and to melanosome shape and 
distribution [3]. Complex gene and protein networks are related 
to these processes, and it is likely that their architectures differ, at 
least in part, across diverse human populations. Our findings have 
important forensic implications, since some authors have suggested 
that eye and hair color predictions can be developed indepen- 
dently of the biogeographic ancestry of the population investigated 
[17,18,44,45]. Results of the present study indicated that for skin 
color this assumption appears to be only partially true. 

Despite the fact that we observed an intermediate efiect of 
heterozygote genotypes in the majority of significant SNPs, it is 
clear that heterozygotes for some polymorphisms had MI means 
similar to those of one of the homozygotes. The description of the 
specific elfect of each genotype is therefore extremely important. 
For instance, HERC2 rsl 129038 and TYR rsl 126809 suggest a 
dominant relationship between alleles, as found by Cook et al. [46] 
in a study of the expression of pigmentation genes across different 
genotype profiles. SNPs ?,LC45A2 rsl 689 1982 and SLC24A5 
rs 1 426654 were previously associated with skin color in European 
and/or Asian populations [6,17,18,46-48], and our results 
indicate similar association in admixed populations. On the other 
hand, HERC2 rsl 129038 was previously associated with eye and 
hair color only [49,50]. This polymorphism is in high linkage 
disequilibrium with HERC2 rsl 29 13832 [50,51], which, in turn, is 
associated with eye, hair, and skin color [46,50], and seems to have 
an on-oflF effect on the 0CA2 gene function [52,53]. Our study 
shows, for the first time, that HERC2 rsl 129038 can have a role in 
skin color, at least in some populations, such as those investigated 
here (Total sample; Table 1). 

Interestingly, some studies have suggested that the SLC45A2 
rsl 689 1982 C allele was associated with darker skin and/or brown 
eyes, whereas in other investigations, this allele was associated with 
lighter skin and/or blue eyes (Table SI in File SI). This 
discrepancy may be due to the fact that C/G diflFerences can 
easily lead to reading mistakes. Our data indicated that the C allele 
(Phe) was related to higher MI values, thus corroborating the 
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In general, the effects described here for the SNPs statistically 
associated with skin color are in agreement with their function- 
ality, as described in the literature. For instance. Ginger et al. [54] 
demonstrated that the expression of the SLC24A5 gene product is 
required for the production of melanin in differentiating human 
epidermal melanoblasts. Also, a functional study conducted by 
Sturm [55] found that in SLC24A5*! 1 lAIa homozygotes the 
amount of melanin in cultured melanocytes increased by 2.2 times, 
as compared to ThrlllThr. More information about the biological 
effects of other SNPs studied here in human pigmentation may be 
viewed in Table S 1 in File S 1 . 

Conclusion 

The main imphcation of our study is the establishment of some 
genetic markers as skin color predictors in admixed populations. 
Two SNPs {SLC24A5 rsl 426654 and SLC45A2 rsl 689 1982) were 
significantiy associated with MI in all analyses. Two other markers 
showed suggestive results {HERC2 rsl 129038 and TYR 
rsl 126809), indicating the need for additional studies for 
confirmation. An interesting finding is that while some polymor- 
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seem to be population-specific, indicating that genotype-pheno- 
type relationships found in one population cannot be uncritically 
extrapolated to another. 
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